Discrete utterance speech recognition without time alignment
نویسندگان
چکیده
منابع مشابه
Within-utterance correlation for speech recognition
Relations between non-adjacent parts of an utterance are commonly regarded as an important source of information for speech recognition. However, they have not been very much used in speech recognition systems. In this paper, we include this information by joint distributions of pairs of phones occurring in the same utterance. In addition to relations between acoustic events, we also have incor...
متن کاملWithin-utterance correlation in automatic speech recognition
Information on relations between separate parts of an utterance can be used to improve the performance of speech recognition systems. In this paper, examples of relations are discussed and some measured data on phone pair correlation is presented. In addition to relations between acoustic events in an utterance, it is also possible to represent relations between acoustic and non-acoustic inform...
متن کاملUtterance verification based speech recognition system
Many existing search algorithms aim at searching for the best hypothesis from all possible hypotheses with the help of techniques like beam search to reduce the computational cost. These search algorithms are based on the competitive criteria because the best hypothesis is determined after we have the knowledge of all other possible hypotheses. In this paper, we investigate the possible use of ...
متن کاملTime- and Acoustic-Mediated Alignment Algorithms for Speech Recognition Evaluation
The paper investigates the timeand acoustic-mediated alignment algorithms that can be used for better speech recognition evaluation. The edit-cost function, which weights the cost of speech unit matches, substitutions, deletions and insertions, is defined as a function of timed symbols or even as a function of speech signal segments. The algorithms are compared using several classical statistic...
متن کاملSpeech segmentation without speech recognition
In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various background and environment. Three feature se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 1983
ISSN: 0018-9448
DOI: 10.1109/tit.1983.1056716